ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies

نویسندگان

  • Luis Espinosa Anke
  • Horacio Saggion
  • Francesco Ronzano
  • Roberto Navigli
چکیده

We introduce EXTASEM!, a novel approach for the automatic learning of lexical taxonomies from domain terminologies. First, we exploit a very large semantic network to collect thousands of in-domain textual definitions. Second, we extract (hyponym, hypernym) pairs from each definition with a CRF-based algorithm trained on manually-validated data. Finally, we introduce a graph induction procedure which constructs a full-fledged taxonomy where each edge is weighted according to its domain pertinence. EXTASEM! achieves state-of-the-art results in the following taxonomy evaluation experiments: (1) Hypernym discovery, (2) Reconstructing gold standard taxonomies, and (3) Taxonomy quality according to structural measures. We release weighted taxonomies for six domains for the use and scrutiny of the community.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards an infrastructure for semantic applications: Methodologies for semantic integration of heterogeneous resources

As with many domains, information retrieval and knowledge management (IR/KM) in agriculture suffers from the problems of semantic heterogeneity, making it difficult for providers to disseminate their services effectively and for users to retrieve the information they need. Based on the analysis of resources in the domain of agriculture, this paper proposes a) application profiles for dealing wi...

متن کامل

Constructing and Evaluating Controlled Bilingual Terminologies

This paper presents the construction and evaluation of Japanese and English controlled bilingual terminologies that are particularly intended for controlled authoring and machine translation with special reference to the Japanese municipal domain. Our terminologies are constructed by extracting terms from municipal website texts, and the term variations are controlled by defining preferred and ...

متن کامل

Ontology Merging Using Belief Revision and Defeasible Logic Programming

We combine argumentation, belief revision and description logic ontologies for extending the δ-ontologies framework to show how to merge two ontologies in which the union of the strict terminologies could lead to inconsistency. To solve this problem, we revisit a procedure presented by Falappa et al. in which part of the offending terminologies are turned defeasible by using a kernel revision o...

متن کامل

In silico structural analysis of quorum sensing genes in Vibrio fischeri

Quorum sensing controls the luminescence of Vibrio fischeri through the transcriptional activator LuxR and the specific autoinducer signal produced by luxI. Amino acid sequences of these two genes were analyzed using bioinformatics tools. LuxI consists of 193 amino acids and appears to contain five α-helices and six ß-sheets when analyzed by SSpro8. LuxI belongs to the autoinducer synthetase fa...

متن کامل

Harmonizing and extending standards from a domain-specific and bottom-up approach: an example from development through use in clinical applications

OBJECTIVE Currently, the processes for harmonizing and extending standards by leveraging the knowledge within local documentation artifacts are not well described. We describe a collaborative project to develop common information models, terminology bindings, and term definitions based on nursing documentation systems, and carry the findings through to the adoption in standards development orga...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016